blog: why cardinality problems show up too late by yupme-bot · Pull Request #2915 · prometheus/docs

yupme-bot · 2026-03-24T15:55:20Z

Draft blog post based on recent discussion around high-cardinality patterns in instrumentation.

Focuses on where issues are introduced vs where they become visible in the pipeline.

Happy to expand on any specific examples from that discussion if useful during review.

Draft blog post based on recent discussion around high-cardinality patterns in instrumentation. Focuses on where issues are introduced vs where they become visible in the pipeline. Happy to expand on any specific examples from that discussion if useful during review. Signed-off-by: yupme-bot <yupme112@gmail.com>

nwanduka

Thanks for working on this, @yupme-bot 👌. I’ve reviewed it for grammar and clarity, and it looks good to me.

@bboreham, when you get the chance, could you please review it for technical accuracy?

nwanduka · 2026-03-24T16:43:58Z

+
+This creates a gap in the pipeline:
+
+```


I wonder how this section will be rendered on the blog.

bboreham

Overall I think this is reasonable, but I made a number of comments while reading.

Mostly I think you should explain more: write to an audience that isn't already immersed in the detail.

bboreham · 2026-05-11T19:45:48Z

+date: 2026-03-23
+---
+
+High-cardinality metrics are a well-known problem in Prometheus. Most people are familiar with the guidance: avoid labels with unbounded values like user IDs, request IDs, or full URL paths.


I think you should define the word "Cardinality" in the blog post, or post a link to somewhere that explains the way in which you are using the word.
For instance it's not this one: https://en.wikipedia.org/wiki/Cardinality_(data_modeling).

bboreham · 2026-05-11T19:46:57Z

+
+## The symptom: cardinality shows up late
+
+In Prometheus, every unique combination of label values creates a new time series. When high-entropy values are used as labels, the number of series can grow quickly.


"high-entropy" is another jargon term that deserves a definition.
Earlier you used "unbounded" to mean (I think) the same thing.

bboreham · 2026-05-11T19:48:17Z

+
+The root cause is usually not in Prometheus itself, but earlier in the pipeline.
+
+With OpenTelemetry-style instrumentation, it is very easy to attach rich context as attributes:


What about this problem is specific to OpenTelemetry?
Did you mean more that it is a general problem, but you are using OpenTelemetry to illustrate?

bboreham · 2026-05-11T19:48:35Z

+
+Nothing about this looks wrong in isolation.
+
+But if those values end up as labels downstream, each distinct value becomes a new time series.


Why wouldn't they end up as labels? Does that even work?

bboreham · 2026-05-11T19:49:24Z

+
+But if those values end up as labels downstream, each distinct value becomes a new time series.
+
+A single line attaching a highly variable value as an attribute can look completely reasonable in a code review.


"highly variable" is another synonym for "unbounded" and "high-entropy"?

bboreham · 2026-05-11T19:50:07Z

+
+### Guidance exists, but is not visible at the right time
+
+The docs are clear about avoiding high-cardinality labels, but that guidance is not always present when writing instrumentation.


Suggest to link to the specific place in the docs.

bboreham · 2026-05-11T19:54:31Z

+
+By the time it is noticed, it is often already affecting production systems.
+
+Most solutions today focus on reducing the impact—normalizing values, limiting label sets, or making attribute-to-label conversion opt-in.


"Most solutions today" sets up in my mind an expectation that you will present something different in this post. But I don't think that expectation is met; these seem to be the solutions you present.

bboreham · 2026-05-11T19:55:08Z

+- **Be careful with attribute-to-label conversion**  
+  Not every attribute needs to become a metric label.


I'm not sure what this means. Can you include a link to where readers could find out more?

bboreham · 2026-05-11T19:58:57Z

+- **Treat cardinality as a design concern**  
+  It is much easier to avoid these issues up front than to fix them later.


Can you give, or point to, some specific ways in which to go about this?
E.g. multiply together the cardinality of each independent label.

nwanduka reviewed Mar 24, 2026

View reviewed changes

bboreham reviewed May 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

blog: why cardinality problems show up too late#2915

blog: why cardinality problems show up too late#2915
yupme-bot wants to merge 1 commit into
prometheus:mainfrom
yupme-bot:yupme-bot-patch-1

yupme-bot commented Mar 24, 2026

Uh oh!

nwanduka left a comment

Uh oh!

nwanduka Mar 24, 2026

Uh oh!

bboreham left a comment

Uh oh!

bboreham May 11, 2026

Uh oh!

bboreham May 11, 2026

Uh oh!

bboreham May 11, 2026

Uh oh!

bboreham May 11, 2026

Uh oh!

bboreham May 11, 2026

Uh oh!

bboreham May 11, 2026

Uh oh!

bboreham May 11, 2026

Uh oh!

bboreham May 11, 2026

Uh oh!

bboreham May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		## The symptom: cardinality shows up late

		In Prometheus, every unique combination of label values creates a new time series. When high-entropy values are used as labels, the number of series can grow quickly.


		The root cause is usually not in Prometheus itself, but earlier in the pipeline.

		With OpenTelemetry-style instrumentation, it is very easy to attach rich context as attributes:


		Nothing about this looks wrong in isolation.

		But if those values end up as labels downstream, each distinct value becomes a new time series.


		But if those values end up as labels downstream, each distinct value becomes a new time series.

		A single line attaching a highly variable value as an attribute can look completely reasonable in a code review.


		### Guidance exists, but is not visible at the right time

		The docs are clear about avoiding high-cardinality labels, but that guidance is not always present when writing instrumentation.


		By the time it is noticed, it is often already affecting production systems.

		Most solutions today focus on reducing the impact—normalizing values, limiting label sets, or making attribute-to-label conversion opt-in.

		- Be careful with attribute-to-label conversion
		Not every attribute needs to become a metric label.

		- Treat cardinality as a design concern
		It is much easier to avoid these issues up front than to fix them later.

Conversation

yupme-bot commented Mar 24, 2026

Uh oh!

nwanduka left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bboreham left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants